A/B testing

Manipulation and cleaning data

Back to top

AppCategoryRatingReviewsSizeInstallsTypePriceContent RatingGenresLast UpdatedCurrent VerAndroid Ver
0Photo Editor & Candy Camera & Grid & ScrapBookART_AND_DESIGN4.115919M10,000+Free0EveryoneArt & DesignJanuary 7, 20181.0.04.0.3 and up
1Coloring book moanaART_AND_DESIGN3.996714M500,000+Free0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and up
2U Launcher Lite – FREE Live Cool Themes, Hide ...ART_AND_DESIGN4.7875108.7M5,000,000+Free0EveryoneArt & DesignAugust 1, 20181.2.44.0.3 and up
3Sketch - Draw & PaintART_AND_DESIGN4.521564425M50,000,000+Free0TeenArt & DesignJune 8, 2018Varies with device4.2 and up
4Pixel Draw - Number Art Coloring BookART_AND_DESIGN4.39672.8M100,000+Free0EveryoneArt & Design;CreativityJune 20, 20181.14.4 and up
The number of duplicate we have is:483
<class 'pandas.core.frame.DataFrame'>
Int64Index:10358 entries, 0 to 10840
Data columns (total 13 columns): # Column Non-Null Count Dtype --- ------ -------------- ----- 0 App 10358 non-null object 1 Category 10358 non-null object 2 Rating 8893 non-null float64 3 Reviews 10358 non-null object 4 Size 10358 non-null object 5 Installs 10358 non-null object 6 Type 10357 non-null object 7 Price 10358 non-null object 8 Content Rating 10357 non-null object 9 Genres 10358 non-null object 10 Last Updated 10358 non-null object 11 Current Ver 10350 non-null object 12 Android Ver 10355 non-null object dtypes:float64(1), object(12)
memory usage:1.1+ MB
The columns with null values are:Rating - Type - Content Rating - Current Ver - Android Ver.
We can see alot of null values in the Rating column:
App 0
Category 0
Rating 1465
Reviews 0
Size 0
Installs 0
Type 1
Price 0
Content Rating 1
Genres 0
Last Updated 0
Current Ver 8
Android Ver 3
dtype:int64

========================================================================================================================

The info table we run earlier shows that the columns Price and Installs are not numbers, we need to change them do we can do arithmetic operation

PriceInstalls
0010,000+
10500,000+
205,000,000+
3050,000,000+
40100,000+
PriceInstalls
234$4.99100,000+
235$4.99100,000+
427$3.99100,000+
476$3.9910,000+
477$6.991,000+

Now we Notice 3 characters that need to be removed $ , +

PriceInstalls
2344.99100000
2354.99100000
4273.99100000
4763.9910000
4776.991000

========================================================================================================================

Examine the app category share in the platform based on the number of installs.

Back to top

AppCategoryRatingReviewsSizeInstallsTypePriceContent RatingGenresLast UpdatedCurrent VerAndroid Ver
0Photo Editor & Candy Camera & Grid & ScrapBookART_AND_DESIGN4.115919M10000.0Free0.0EveryoneArt & DesignJanuary 7, 20181.0.04.0.3 and up
1Coloring book moanaART_AND_DESIGN3.996714M500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and up

Sorting apps category based on number of apps.

Back to top

Extend the graph of sorting to show the mean of rating, the sum of installs, size, number of reviews for each category.

Back to top

CategoryAppRatingReviewsSizeInstalls
0FAMILY19434.19115339677196931815.61.004169e+10
1GAME11214.281285141553665025593.23.154402e+10
2TOOLS8434.04741127318504412521.51.145277e+10
3BUSINESS4274.102593123581718148.48.636649e+08
4MEDICAL4084.18245013967576329.74.220418e+07
CategoryAppRatingReviews_in_MSize_in_GInstalls_in_B
1FAMILY19434.19396.7731.810.04
2GAME11214.281415.5425.631.54
3TOOLS8434.05273.1912.511.45
4BUSINESS4274.1012.368.10.86
5MEDICAL4084.181.406.30.04

=========================================================================

Making a heat map to see the relationship between Rating and size of the app and price.

Back to top

AppCategoryRatingReviewsSizeInstallsTypePriceContent RatingGenresLast UpdatedCurrent VerAndroid Ver
0Photo Editor & Candy Camera & Grid & ScrapBookART_AND_DESIGN4.115919.010000.0Free0.0EveryoneArt & DesignJanuary 7, 20181.0.04.0.3 and up
1Coloring book moanaART_AND_DESIGN3.996714.0500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and up
AppCategoryRatingReviewsSizeInstallsTypePriceContent RatingGenresLast UpdatedCurrent VerAndroid Ver
0Photo Editor & Candy Camera & Grid & ScrapBookART_AND_DESIGN4.115919.010000.0Free0.0EveryoneArt & DesignJanuary 7, 20181.0.04.0.3 and up
1Coloring book moanaART_AND_DESIGN3.996714.0500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and up
2U Launcher Lite – FREE Live Cool Themes, Hide ...ART_AND_DESIGN4.7875108.75000000.0Free0.0EveryoneArt & DesignAugust 1, 20181.2.44.0.3 and up
3Sketch - Draw & PaintART_AND_DESIGN4.521564425.050000000.0Free0.0TeenArt & DesignJune 8, 2018Varies with device4.2 and up
4Pixel Draw - Number Art Coloring BookART_AND_DESIGN4.39672.8100000.0Free0.0EveryoneArt & Design;CreativityJune 20, 20181.14.4 and up

Looking at app price distribution with and after removing overlays.

Back to top

Number of install distribution for the free and paid app.

Back to top

Adding review data and examine review polarity(positive-negative) base on app content rating(kids, teen, adult, everyone) and base of the type of the app(Free paid).

Back to top

AppCategoryRatingReviewsSizeInstallsTypePriceContent RatingGenresLast UpdatedCurrent VerAndroid VerTranslated_ReviewSentimentSentiment_PolaritySentiment_Subjectivity
0Coloring book moanaART_AND_DESIGN3.996714.0500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and upA kid's excessive ads. The types ads allowed a...Negative-0.2501.000000
1Coloring book moanaART_AND_DESIGN3.996714.0500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and upIt bad >:(Negative-0.7250.833333
2Coloring book moanaART_AND_DESIGN3.996714.0500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and uplikeNeutral0.0000.000000
4Coloring book moanaART_AND_DESIGN3.996714.0500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and upI love colors inspyeringPositive0.5000.600000
5Coloring book moanaART_AND_DESIGN3.996714.0500000.0Free0.0EveryoneArt & Design;Pretend PlayJanuary 15, 20182.0.04.0.3 and upI hateNegative-0.8000.900000
AppCategoryRatingReviewsSizeInstallsTypePriceContent RatingGenresLast UpdatedCurrent VerAndroid VerTranslated_ReviewSentimentSentiment_PolaritySentiment_Subjectivity
count473644736447364.0000004.736400e+0447364.0000004.736400e+044736447364.0000004736447364473644736447364473644736447364.00000047364.000000
unique65033NaNNaNNaNNaN2NaN56420741620216083NaNNaN
topHelix JumpGAMENaNNaNNaNNaNFreeNaNEveryoneActionJuly 31, 2018Varies with device4.1 and upGoodPositiveNaNNaN
freq136515393NaNNaNNaNNaN46965NaN3632453114024125991303024229916NaNNaN
meanNaNNaN4.3502723.071857e+0632.2767468.765923e+07NaN0.043968NaNNaNNaNNaNNaNNaNNaN0.1487500.497285
stdNaNNaN0.2622597.725467e+0626.7371342.023989e+08NaN0.562501NaNNaNNaNNaNNaNNaNNaN0.3277190.232981
minNaNNaN2.7000004.600000e+011.1000001.000000e+03NaN0.000000NaNNaNNaNNaNNaNNaNNaN-1.0000000.000000
25%NaNNaN4.2000002.743900e+049.8000001.000000e+06NaN0.000000NaNNaNNaNNaNNaNNaNNaN-0.0166670.390000
50%NaNNaN4.4000003.387420e+0522.0000001.000000e+07NaN0.000000NaNNaNNaNNaNNaNNaNNaN0.1216050.509524
75%NaNNaN4.5000002.440695e+0652.0000001.000000e+08NaN0.000000NaNNaNNaNNaNNaNNaNNaN0.3500000.629048
maxNaNNaN4.9000007.815831e+07100.0000001.000000e+09NaN9.990000NaNNaNNaNNaNNaNNaNNaN1.0000001.000000